Parametrisation of the speech space using the self-organising neural network

نویسندگان

  • Roberto Togneri
  • Yianni Attikiouzel
چکیده

Speech recognition is a diicult problem due to the inability of current systems to cope with connected speech. Neural networks are able to learn some aspects of this task. An unsupervised learning scheme like the self-organising map can be used to both classify and order the speech sounds and provide a front end to higher level processing. A map of phonemes (phonotopic map) is used to trace trajectories of sounds from utterances. The self-organising map provides a means of reducing the inherent dimensionality of the speech data. A crinkle factor which is used to determine how close the dimensionality of the map is to the dimensionality of the speech input shows that speech has an inherent dimensionality of at least three or four. A projection of the map and the speech data shows how the self-organising map ts the speech space.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Processing Using Artificial Neural Networks

A three layer perceptron network is used to classify the /i/ sound using isolated words from diierent speakers. A classiication accuracy of 97% has been achieved. A map of phonemes is used to trace trajectories of utterances using the self-organising neural network. A crinkle factor is proposed which allows using the self-organising map to determine the inherent dimensionality of a set of point...

متن کامل

Generalisation and discrimination emerge from a self-organising componential network: a speech example

It is demonstrated that a componential code emerges when a self-organising neural network is exposed to continuous speech. The code’s components correspond to substructures that occur relatively independently of one another: words and phones. A capability for generalisation and discrimination develops without having been optimised explicitly. The componential structure is revealed by optimising...

متن کامل

بهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگی‌های استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز

The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...

متن کامل

Developing A Fault Diagnosis Approach Based On Artificial Neural Network And Self Organization Map For Occurred ADSL Faults

Telecommunication companies have received a great deal of research attention, which have many advantages such as low cost, higher qualification, simple installation and maintenance, and high reliability. However, the using of technical maintenance approaches in Telecommunication companies could improve system reliability and users' satisfaction from Asymmetric digital subscriber line (ADSL) ser...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990